AITopics | q-value prediction

Collaborating Authors

q-value prediction

Information about AI from the News, Publications, and Conferences

Automatic Classification – Tagging and Summarization – Customizable Filtering and Analysis

If you are looking for an answer to the question What is Artificial Intelligence? and you only have a minute, then here's the definition the Association for the Advancement of Artificial Intelligence offers on its home page: "the scientific understanding of the mechanisms underlying thought and intelligent behavior and their embodiment in machines."

However, if you are fortunate enough to have more than a minute, then please get ready to embark upon an exciting journey exploring AI (but beware, it could last a lifetime) …

Beyond Black-Box Advice: Learning-Augmented Algorithms for MDPs with Q-Value Predictions

Neural Information Processing SystemsDec-26-2025, 07:58:42 GMT

We study the tradeoff between consistency and robustness in the context of a single-trajectory time-varying Markov Decision Process (MDP) with untrusted machine-learned advice. Our work departs from the typical approach of treating advice as coming from black-box sources by instead considering a setting where additional information about how the advice is generated is available. We prove a first-of-its-kind consistency and robustness tradeoff given Q-value advice under a general MDP model that includes both continuous and discrete state/action spaces. Our results highlight that utilizing Q-value advice enables dynamic pursuit of the better of machine-learned advice and a robust baseline, thus result in near-optimal performance guarantees, which provably improves what can be obtained solely with black-box advice.

black-box advice, learning-augmented algorithm, name change, (5 more...)

Neural Information Processing Systems

Industry: Transportation > Air (0.92)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Beyond Black-Box Advice: Learning-Augmented Algorithms for MDPs with Q-Value Predictions

Neural Information Processing SystemsJan-19-2025, 14:31:55 GMT

black-box advice, learning-augmented algorithm, q-value prediction, (2 more...)

Neural Information Processing Systems

Industry: Transportation > Air (0.93)

Technology: Information Technology > Artificial Intelligence > Machine Learning (1.00)

Add feedback

Smart Sampling: Self-Attention and Bootstrapping for Improved Ensembled Q-Learning

Khan, Muhammad Junaid, Ahmed, Syed Hammad, Sukthankar, Gita

arXiv.org Artificial IntelligenceMay-13-2024

We present a novel method aimed at enhancing the sample efficiency of ensemble Q learning. Our proposed approach integrates multi-head self-attention into the ensembled Q networks while bootstrapping the state-action pairs ingested by the ensemble. This not only results in performance improvements over the original REDQ (Chen et al. 2021) and its variant DroQ (Hi-raoka et al. 2022), thereby enhancing Q predictions, but also effectively reduces both the average normalized bias and standard deviation of normalized bias within Q-function ensembles. Importantly, our method also performs well even in scenarios with a low update-to-data (UTD) ratio. Notably, the implementation of our proposed method is straightforward, requiring minimal modifications to the base model.

ensemble, estimation bias, q-value prediction, (13 more...)

arXiv.org Artificial Intelligence

2405.08252

Country:

North America > United States > Florida > Hillsborough County > University (0.04)
North America > United States > New York > New York County > New York City (0.04)
North America > United States > Florida > Orange County > Orlando (0.04)
(2 more...)

Genre: Research Report (1.00)

Technology:

Information Technology > Artificial Intelligence > Machine Learning > Reinforcement Learning (1.00)
Information Technology > Artificial Intelligence > Machine Learning > Performance Analysis > Accuracy (0.64)

Add feedback